Cache Management in Scalable Network Servers

نویسنده

Vivek S. Pai

چکیده

For many users, the perceived speed of computing is increasingly dependent on the performance of network server systems, underscoring the need for high performance servers. Cost-effective scalable network servers can be built on clusters of commodity components (PCs and LANs) instead of using expensive multiprocessor systems. However, network servers cache files to reduce disk access, and the cluster’s physically disjoint memories complicate sharing cached file data. Additionally, the physically disjoint CPUs complicate the problem of load balancing. This work examines the issue of cache management in scalable network servers at two levels – per-node (local) and cluster-wide (global). Per-node cache management is addressed by the IO-Lite unified buffering and caching system. Applications and various parts of the operating system currently use incompatible buffering schemes, resulting in unnecessary data copying. For network servers, overall throughput drops for two reasons – copying wastes CPU cycles, and multiple copies of data compete with the filesystem cache for memory. IO-Lite allows applications, the operating system, file system, and network code to safely and securely share a single copy of data. The cluster-wide solution uses a technique called Locality-Aware Request Distribution (LARD) that examines the content of incoming requests to determine which node in a cluster should handle the request. LARD uses the request content to dynamically partition the incoming request stream. This partitioning increases the file cache hit rates on the individual nodes, and it maintains load balance in the cluster.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Khnum - A Scalable Rapid Application Deployment System for Dynamic Hosting Infrastructures

In a dynamically scalable hosting infrastructure for e-business computing, servers need to be quickly allocated in order to satisfy a sudden demand for increased computing power for a hosted site. Khnum is the applications and data management component of Océano a dynamically scalable hosting infrastructure for e-business computing utilities. It is responsible for server reconfiguration and for...

متن کامل

Parallel Cache Management Protocol for Static and Dynamic Web Contents

This paper studies Random Request Distribution and Load Balancing Cache protocol (RLC) and Randomly Selected and Limited Look at neighbor (RSLL) cache algorithm based on Byte Access Frequency Factor (BAFF) for internet services with multimedia and dynamic web content. The RSLL cache algorithm considers reducing response time and achieving good load balance among web servers using limitation of ...

متن کامل

Analysis and Modeling of VoIP Servers: A Linear Programming Approach

The SIP protocol was standardized by the IETF at the application layer for initiating, managing, and terminating multimedia sessions and has been widely used as the main signaling protocol on both the Internet and VoIP networks. Most challenges in this protocol are overload and lack of proper state distribution. These challenges cause a wide range of next-generation network users to face a shar...

متن کامل

Optimal Replication Algorithm for Scalable Streaming Media in Content Delivery Networks

CDN (Content Delivery Networks) improves end-user performance by replicating web contents on a group of geographically distributed servers. However, repeatedly keeping the entire replica of the original objects into many content servers consumes too much server resource. This problem becomes more serious for the large-sized objects such as streaming media, e.g. high quality video. In this paper...

متن کامل

CX: A scalable, robust network for parallel computing

CX, a network-based computational exchange, is presented. The system’s design integrates variations of ideas from other researchers, such as work stealing, non-blocking tasks, eager scheduling, and space-based coordination. The object-oriented API is simple, compact, and cleanly separates application logic from the logic that supports interprocess communication and fault tolerance. Computations...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Cache Management in Scalable Network Servers

نویسنده

چکیده

منابع مشابه

Khnum - A Scalable Rapid Application Deployment System for Dynamic Hosting Infrastructures

Parallel Cache Management Protocol for Static and Dynamic Web Contents

Analysis and Modeling of VoIP Servers: A Linear Programming Approach

Optimal Replication Algorithm for Scalable Streaming Media in Content Delivery Networks

CX: A scalable, robust network for parallel computing

عنوان ژورنال:

اشتراک گذاری